Видео с ютуба Quantized Llm
CMU Advanced NLP Fall 2025 (19): Quantization
Model Quantization for efficient deployment with Amazon SageMaker AI | Amazon Web Services
This Training Trick Fixes AI Quantization (3-Bit Secret)
ICQuant: Index Coding enables Low-bit LLM Quantization
Model compression techniques, Quantization, knowledge distillation, Inference latency optimization
INT vs FP: Fine-Grained Low-Bit LLM Quantization
QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs
Quantization of LLM using LLAMA.cpp
AI Optimization Lecture 3: Distillation, Pruning, and Quantization
Ускоренный курс LLM по тонкой настройке | Учебное пособие LLM по тонкой настройке
Extreme Quantization: Creating the Smallest & Dumbest LLM (63MB GPT-2 Model!)
Unleashing the Power of Tiny LLMs: Extreme Quantization Techniques
Extreme Quantization: Building a Tiny, Fun LLM
Unleashing the Power of Tiny LLMs: Extreme Quantization Techniques
Extreme Quantization: Creating the Smallest & Dumbest LLM (63MB Model!)
Extreme Quantization: Creating the Smallest & Dumbest LLM (63MB Model!)
Extreme Quantization: Creating the Smallest and Dumbest LLM (63MB Model!)
Extreme Quantization: Building a Tiny LLM with 4-bit Integers
quantized inclusionAI/Ring-1T iq2 xxs locally